Overview

Dataset Statistics

Number of Variables 12
Number of Rows 2968
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 301.4 KB
Average Row Size in Memory 104.0 B
Variable Types
  • Numerical: 12

Dataset Insights

qtde_products and avg_recency_days have similar distributions Similar Distribution
gross_revenue is skewed Skewed
recency_days is skewed Skewed
qtde_invoices is skewed Skewed
qtde_items is skewed Skewed
qtde_products is skewed Skewed
avg_ticket is skewed Skewed
frequency is skewed Skewed
qtde_returns is skewed Skewed
avg_basket_size is skewed Skewed
avg_unique_basket_size is skewed Skewed
qtde_returns has 1481 (49.9%) zeros Zeros
  • 1
  • 2

Variables


customer_id

numerical

Approximate Distinct Count 2968
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 15270.377
Minimum 12347
Maximum 18287
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • customer_id is skewed right (γ1 = 0.0322)

Quantile Statistics

Minimum 12347
5-th Percentile 12619.35
Q1 13798.75
Median 15220.5
Q3 16768.5
95-th Percentile 17964.65
Maximum 18287
Range 5940
IQR 2969.75

Descriptive Statistics

Mean 15270.377
Standard Deviation 1719.1445
Variance 2.9555e+06
Sum 4.5322e+07
Skewness 0.03218
Kurtosis -1.2062
Coefficient of Variation 0.1126

gross_revenue

numerical

Approximate Distinct Count 2953
Approximate Unique (%) 99.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 2693.4851
Minimum 6.2
Maximum 279138.02
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • gross_revenue is skewed right (γ1 = 17.6265)

Quantile Statistics

Minimum 6.2
5-th Percentile 229.7325
Q1 570.845
Median 1085.51
Q3 2306.905
95-th Percentile 7169.562
Maximum 279138.02
Range 279131.82
IQR 1736.06

Descriptive Statistics

Mean 2693.4851
Standard Deviation 10135.4653
Variance 1.0273e+08
Sum 7.9943e+06
Skewness 17.6265
Kurtosis 396.6303
Coefficient of Variation 3.763
  • gross_revenue is not normally distributed (p-value 4.949410597802456e-25)
  • gross_revenue has 268 outliers

recency_days

numerical

Approximate Distinct Count 272
Approximate Unique (%) 9.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 64.3093
Minimum 0
Maximum 373
Zeros 33
Zeros (%) 1.1%
Negatives 0
Negatives (%) 0.0%
  • recency_days is skewed right (γ1 = 1.7971)

Quantile Statistics

Minimum 0
5-th Percentile 2
Q1 11
Median 31
Q3 81
95-th Percentile 242
Maximum 373
Range 373
IQR 70

Descriptive Statistics

Mean 64.3093
Standard Deviation 77.7609
Variance 6046.7611
Sum 190870
Skewness 1.7971
Kurtosis 2.7698
Coefficient of Variation 1.2092
  • recency_days is not normally distributed (p-value 9.89237738834044e-12)
  • recency_days has 286 outliers

qtde_invoices

numerical

Approximate Distinct Count 56
Approximate Unique (%) 1.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 5.7244
Minimum 1
Maximum 206
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • qtde_invoices is skewed right (γ1 = 10.7601)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 2
Median 4
Q3 6
95-th Percentile 17
Maximum 206
Range 205
IQR 4

Descriptive Statistics

Mean 5.7244
Standard Deviation 8.8578
Variance 78.4599
Sum 16990
Skewness 10.7601
Kurtosis 190.463
Coefficient of Variation 1.5474
  • qtde_invoices is not normally distributed (p-value 7.384048226847717e-24)
  • qtde_invoices has 235 outliers

qtde_items

numerical

Approximate Distinct Count 1670
Approximate Unique (%) 56.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 1582.1044
Minimum 1
Maximum 196844
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • qtde_items is skewed right (γ1 = 18.7282)

Quantile Statistics

Minimum 1
5-th Percentile 102.35
Q1 296
Median 640
Q3 1399.5
95-th Percentile 4403.25
Maximum 196844
Range 196843
IQR 1103.5

Descriptive Statistics

Mean 1582.1044
Standard Deviation 5705.2914
Variance 3.255e+07
Sum 4.6957e+06
Skewness 18.7282
Kurtosis 515.8697
Coefficient of Variation 3.6061
  • qtde_items is not normally distributed (p-value 4.6616686944915825e-25)
  • qtde_items has 258 outliers

qtde_products

numerical

Approximate Distinct Count 341
Approximate Unique (%) 11.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 79.3494
Minimum 1
Maximum 1786
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • qtde_products is skewed right (γ1 = 6.3871)

Quantile Statistics

Minimum 1
5-th Percentile 7
Q1 26
Median 52
Q3 101
95-th Percentile 233.65
Maximum 1786
Range 1785
IQR 75

Descriptive Statistics

Mean 79.3494
Standard Deviation 96.8613
Variance 9382.1141
Sum 235509
Skewness 6.3871
Kurtosis 82.2623
Coefficient of Variation 1.2207
  • qtde_products is not normally distributed (p-value 1.141470033401305e-16)
  • qtde_products has 196 outliers

avg_ticket

numerical

Approximate Distinct Count 2965
Approximate Unique (%) 99.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 32.9943
Minimum 2.1506
Maximum 4453.43
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_ticket is skewed right (γ1 = 25.1443)

Quantile Statistics

Minimum 2.1506
5-th Percentile 4.9159
Q1 13.1181
Median 17.9534
Q3 24.9818
95-th Percentile 90.0521
Maximum 4453.43
Range 4451.2794
IQR 11.8637

Descriptive Statistics

Mean 32.9943
Standard Deviation 119.5321
Variance 14287.9147
Sum 97926.9539
Skewness 25.1443
Kurtosis 811.5938
Coefficient of Variation 3.6228
  • avg_ticket is not normally distributed (p-value 4.477639253249223e-25)
  • avg_ticket has 345 outliers

avg_recency_days

numerical

Approximate Distinct Count 1258
Approximate Unique (%) 42.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 67.3021
Minimum 1
Maximum 366
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_recency_days is skewed right (γ1 = 2.065)

Quantile Statistics

Minimum 1
5-th Percentile 8
Q1 25.9173
Median 48.2679
Q3 85.3333
95-th Percentile 200.65
Maximum 366
Range 365
IQR 59.416

Descriptive Statistics

Mean 67.3021
Standard Deviation 63.5054
Variance 4032.9306
Sum 199752.7303
Skewness 2.065
Kurtosis 4.8978
Coefficient of Variation 0.9436
  • avg_recency_days is not normally distributed (p-value 0.00033446659552035184)
  • avg_recency_days has 211 outliers

frequency

numerical

Approximate Distinct Count 1225
Approximate Unique (%) 41.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 0.1138
Minimum 0.00545
Maximum 17
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • frequency is skewed right (γ1 = 24.8643)

Quantile Statistics

Minimum 0.00545
5-th Percentile 0.008894
Q1 0.01634
Median 0.0259
Q3 0.04948
95-th Percentile 1
Maximum 17
Range 16.9946
IQR 0.03314

Descriptive Statistics

Mean 0.1138
Standard Deviation 0.4082
Variance 0.1666
Sum 337.8545
Skewness 24.8643
Kurtosis 987.3989
Coefficient of Variation 3.5862
  • frequency is not normally distributed (p-value 5.594865226411614e-25)
  • frequency has 371 outliers

qtde_returns

numerical

Approximate Distinct Count 213
Approximate Unique (%) 7.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 34.8885
Minimum 0
Maximum 9014
Zeros 1481
Zeros (%) 49.9%
Negatives 0
Negatives (%) 0.0%
  • qtde_returns is skewed right (γ1 = 21.9643)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 1
Q3 9
95-th Percentile 100
Maximum 9014
Range 9014
IQR 9

Descriptive Statistics

Mean 34.8885
Standard Deviation 282.8648
Variance 80012.486
Sum 103549
Skewness 21.9643
Kurtosis 595.1961
Coefficient of Variation 8.1077
  • qtde_returns is not normally distributed (p-value 4.288388586229661e-25)
  • qtde_returns has 416 outliers

avg_basket_size

numerical

Approximate Distinct Count 1978
Approximate Unique (%) 66.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 236.2529
Minimum 1
Maximum 6009.3333
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_basket_size is skewed right (γ1 = 7.698)

Quantile Statistics

Minimum 1
5-th Percentile 44
Q1 103.2375
Median 172.2917
Q3 281.5481
95-th Percentile 599.58
Maximum 6009.3333
Range 6008.3333
IQR 178.3106

Descriptive Statistics

Mean 236.2529
Standard Deviation 283.8932
Variance 80595.3471
Sum 701198.5657
Skewness 7.698
Kurtosis 102.6066
Coefficient of Variation 1.2016
  • avg_basket_size is not normally distributed (p-value 4.444268239195183e-16)
  • avg_basket_size has 178 outliers

avg_unique_basket_size

numerical

Approximate Distinct Count 906
Approximate Unique (%) 30.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 46.4 KB
Mean 17.49
Minimum 0.2
Maximum 259
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • avg_unique_basket_size is skewed right (γ1 = 3.4347)

Quantile Statistics

Minimum 0.2
5-th Percentile 2
Q1 7.6667
Median 13.6
Q3 22.1446
95-th Percentile 46
Maximum 259
Range 258.8
IQR 14.478

Descriptive Statistics

Mean 17.49
Standard Deviation 15.4601
Variance 239.0155
Sum 51910.2518
Skewness 3.4347
Kurtosis 29.2733
Coefficient of Variation 0.8839
  • avg_unique_basket_size is not normally distributed (p-value 1.939527754771593e-11)
  • avg_unique_basket_size has 176 outliers

Interactions

Correlations

Missing Values